Unsupervised Multi-View Feature Selection via Co-Regularization
نویسنده
چکیده
Existing unsupervised feature selection algorithms are designed to extract the most relevant subset of features that can facilitate clustering and interpretation of the obtained results. However, these techniques are not applicable in many real-world scenarios where one has an access to datasets consisting of multiple views/representations e.g. various omics profiles of the patients, medical text records coupled with FMRI images, etc. In this paper, we introduce a novel unsupervised multi-view feature selection algorithm that can simultaneously extract a relevant subset of features and perform clustering that is consistent across different views. By leveraging information from these different views we obtain more robust and accurate results in comparison to traditional methods. Our algorithm works by co-regularizing the multiple clustering hypothesis and at the same time learning the feature ranking matrix. In the empirical evaluation on high dimensional multi-view metagenome data we demonstrate the applicability and efficacy of the proposed algorithm. Last but not least, a Python implementation of our algorithm is available for download, easy to setup and use.1
منابع مشابه
Robust Matrix Elastic Net based Canonical Correlation Analysis: An Effective Algorithm for Multi-View Unsupervised Learning
This paper presents a robust matrix elastic net based canonical correlation analysis (RMEN-CCA) for multiple view unsupervised learning problems, which emphasizes the combination of CCA and the robust matrix elastic net (RMEN) used as coupled feature selection. The RMEN-CCA leverages the strength of the RMEN to distill naturally meaningful features without any prior assumption and to measure ef...
متن کاملStructure Regularized Unsupervised Discriminant Feature Analysis
Feature selection is an important technique in machine learning research. An effective and robust feature selection method is desired to simultaneously identify the informative features and eliminate the noisy ones of data. In this paper, we consider the unsupervised feature selection problem which is particularly difficult as there is not any class labels that would guide the search for releva...
متن کاملAdaptive Unsupervised Multi-view Feature Selection for Visual Concept Recognition
To reveal and leverage the correlated and complemental information between different views, a great amount of multi-view learning algorithms have been proposed in recent years. However, unsupervised feature selection in multiview learning is still a challenge due to lack of data labels that could be utilized to select the discriminative features. Moreover, most of the traditional feature select...
متن کاملA Survey on Multi-View Clustering
With advances in information acquisition technologies, multi-view data become ubiquitous. Multi-view learning has thus become more and more popular in machine learning and data mining fields. Multi-view unsupervised or semi-supervised learning, such as co-training, co-regularization has gained considerable attention. Although recently, multi-view clustering (MVC) methods have been developed rap...
متن کاملFeature Selection via Joint Embedding Learning and Sparse Regression
The problem of feature selection has aroused considerable research interests in the past few years. Traditional learning based feature selection methods separate embedding learning and feature ranking. In this paper, we introduce a novel unsupervised feature selection approach via Joint Embedding Learning and Sparse Regression (JELSR). Instead of simply employing the graph laplacian for embeddi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015